ISSS608 G1 Group 5


Chen Yuxi 1

Lim Yong Kai 1

Jovinka Hartanto 1

1 Singapore Management University

Introduction

The VAST 2021 Mini Challenge 2 outlines a hypothetical scenario in which several GAStech employees have gone missing and the organisation, Protectors of Kronos (POK), is suspected of being involved. The challenge requires identification of suspicious activities hidden in data and determine any dubious people and locations that should be reported to the law enforcement using interactive visual analytical tools.

Motivation and Objectives

Our research and development effort aims to enable Kronos law enforcement with the ability to easily analyse, drill down and identify key suspects and suspicious locations, and thereby speeding up the investigation process.

The use cases of the data visualisation tool include but are not limited to the following:

  • The most popular locations and when they are popular
  • Infer the owner of each credit card and loyalty card
  • Identify potential informal or unofficial relationships among GASTech personnel
  • Analyze unusual and suspicious activities of the employees
  • Analyze daily routines of GasTech employees

The application hat aims to provide users actionable insights based on the following analyses:

  • Exploratory Data Analysis (EDA) of GPS tracking data and credit and loyalty card transaction patterns
  • Visualisation of employee movements over time and associated purchase transactions
  • Network analysis of employees based on their visited locations

Approach and Methodology

Application

The Application was designed using R Shiny and hosted on shinyapps.io server. The application has 5 main section with various interactive visual analytics tools to allow users to investigate the data. The user guide to the application is available at https://isss608g1group5.netlify.app/userguide.html.

Abila Kronos Map

Add desc and picture here later!!!

Locations Transactions

The tab allow users to investigate GASTech employee’s credit card transactions at the various locations made in Abila and three interactive visualisations are available in this tab. All three interactive visualisation share five filters where users can select by location, employee, date, time period of transaction and department of each employee to dynamically update the visualisations.

Heatmap of Location Visit Frequency

Figure 1 provides an overview of the frequency of transactions at each location by date. Hovering over each region will show the micro-data of frequency count for that location by date.
Heatmap

Figure 1: Heatmap

Boxplot of Credit Card Transactions

Figure 2 displays each credit card transaction price at varying locations. Hovering over the boxplot will display the micro-data of the median price for the location and the corresponding credit card number and transaction price.
Boxplot

Figure 2: Boxplot

Barplot of Number of Transactions per Day per Location

Figure 2 displays the number of transactions for each location by date and colored by time period. Hovering over the columns displays the micro-data of the number of transactions performed at the location on the date during the time period.
Stacked Barplot

Figure 3: Stacked Barplot

Card Mapping

Figure 4 displays the credit card and loyalty card mapping to each GASTech employee with reference to their car GPS data. Users can select the different credit card number in the filter to dynamic change the plot to observe the respective transactions made at the different locations. The difference in price of each matching credit card and loyalty card is also made available in the plot. Users can utilise the interactive plot to highlight different selections throughout the parallel coordinate plot.
Parallel Coordinate Plot

Figure 4: Parallel Coordinate Plot

Network Analysis

Network analysis depicts the movement of GASTech employees to corresponding locations. There are two network analysis presented based on their credit card transaction history or on their car GPS movement and locations. Figure 5 displays the network analysis based on credit card transactions. Each node represents an employee or location and each edge represents the connection between each nodes. Each visit is represented by 1 edge line and repeated visit will depict a thicker edge line. Users can click the nodes on the interactive plots to highlight only the edge lines linked to the node. Users can also select department to highlight the nodes of employees from the department.
Network Analysis based on Credit Card Transactions

Figure 5: Network Analysis based on Credit Card Transactions

Future Work

Conclusion

Through the utilisation of interactive visual analytical tools in the Shiny application, users can visualise patterns and anomaly activities of the employees. The user-friendly layout incorporated with drop down filters allow users to slice the data and utilise the interactivity of each visusalisation to investigate the data without knowledge of programming or data analytics.

Acknowledgement

We would like to thank Professor Kam Tin Seong for his guidance throughout the tenure of the project.

STAGEM: Saptiotemporal Analysis of Gastech Employees’ Movement Data